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Abstract. A real valued function / defined on a convex K is an ap- 
proximately convex function iff it satisfies 

A thorough study of approximately convex functions is made. The prin- 
cipal results are a sharp universal upper bound for lower semi-continuous 
approximately convex functions that vanish on the vertices of a simplex 
and an explicit description of the unique largest bounded approximately 
convex function E vanishing on the vertices of a simplex. 

A set A in a normed space is an approximately convex set iff for 
all a,b G A the distance of the midpoint (a + b)/2 to A is < f. The 
bounds on approximately convex functions are used to show that in R n 
with the Euclidean norm, for any approximately convex set A, any point 
z of the convex hull of A is at a distance of at most [log 2 (n — 1)] + f + 
(n— l)^' 082 *- 11-1 " from A. Examples are given to show this is the sharp 
bound. Bounds for general norms on R™ are also given. 
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1. Introduction 

The problem motivating this paper is the following: given a set A in 
R n , estimate the size of the convex hull Co(A) of A in terms of geometric 
properties of A. To do this we assume that R n is equipped with a norm 
|| • || . Then a first step in constructing the convex hull of A is to add all 
the midpoints of segments joining points of A. The size of Co(A) can be 
estimated in terms of this first step. 

Our main result gives the sharp constants in this estimate for n-dimensional 
Euclidean spaces and it provides an estimate for the constants for gen- 
eral n-dimensional normed spaces which is accurate to within 2/n. (Here 
dist(x, A) := infjUx — a|| : a G A} is the distance of the point x from the set 
A.) 

Theorem 1. // (R n , || • ||) is an n-dimensional normed linear space then 
there is a constant Cn.n, depending on the norm \\ ■ \\, so that if A C R n 
satisfies 

(1.1) O0)«i G A implies dist ^ - ^ — ~~'^) — ^' 

then 

z G Co(A) implies dist(z,^4) < C||.|i<5. 
Letting [■] be the greatest integer function, the sharp constant Cn.n satisfies 

[log 2 (n - 1)] + 1 + (n - i)/2[ 1 °S2("- 1 )] < c u < [log 2 (n)] + 1 + n/2 [Xo ^ n)] 

(this holds for all norms) and the sharp constant when || • || is the Euclidean 
norm is C\y\\ = [log 2 (n - 1)] + 1 + (n - l)/2P°Sa(n-i)] . 

The upper bound Cn.n < 2[log 2 (n+ 1)] (where [•] is the ceiling function) 
is implicit in the paper [Q, Props 3.3 and 3.4] of Casini and Papini. 

For bounded sets this can be given a concise restatement in terms of the 
Hausdorff distance between sets. Recall that if A, B C R" are bounded then 
the Hausdorff distance, du{A,B), between A and B is the infimum of 
the numbers r so that every point of A is within a distance r of a point of 
B and every point of B is within distance r of a point of A. Define numbers 
n(n) for n > by k(0) := and 

K (n) := [log 2 (n)] + 1 + 
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for n > 1. The collection of midpoints of segments joining pairs of points of 
A is \{A + A) = {(a + b)/2 : a,b 6 A}. Then Theorem g can be restated as 

d H (Co(A),A)<C H d H (±(A + A),A 

where the sharp constant Ciui satisfies 

«(n - 1) < C||.|| < «(n) 

and Cm. II = n(n — 1) when || • || is the Euclidean norm. (Allowing +00 for 
a value of dH(A,B) this also holds for unbounded sets). If A is a finite 
set with N points, then djj{A, ^(A + A)) = maxfe jCgj 4 min ag A ||a — |(6 + c)|| 
which can be computed in 0(N 3 ) operations. Thus for finite sets Theorem |] 
allows estimation of dfj(A, Co(A)) in polynomial time. 

For general norms obtaining the lower bound n(n — 1) < Cnn is more 
difficult than the upper bound and involves construction of some interest- 
ing geometric objects, the extremal approximately convex functions. To 
describe these we first make a couple of definitions. The following is moti- 
vated by taking 5 = 1 in the hypothesis of Theorem [l]. 

Definition 1. Let (X, || • ||) be a normed space. Then a subset A C X is an 
approximately convex set iff for all a,b G A 



dist(-(a + 6),A) < 1. 



□ 



If A is an approximately convex set then the function h(x) = dist(x, A) 
(the distance of x from A) will satisfy a weak form of the inequality satisfied 
by a convex function. We isolate this property: 

Definition 2. Let E be a convex set in the normed space (X, || • ||). Then 
a function h: E — > R is an approximately convex function iff for all 

a,b £ E 

h ^_b^h(a)+h(b) +i q 

(Strictly speaking this should be "approximately midpoint convex" or "ap- 
proximately Jensen convex" but for the sake of brevity we will use "ap- 
proximately convex".) Let A n := {(cto, . . . , a n ) : otk > 0, X^=o a k} be 
the standard n-dimensional simplex. Then the result leading to the lower 
bounds on Cm.|| is the explicit computation of the extremal approximately 
convex function on the simplex. 

Theorem 2. There is an approximately convex E: A n — ► R which vanishes 
on the vertices of A n with the following properties: 

1. If h is a bounded (or Borel-measurable) approximately convex function 
on A n which takes non-positive values on the vertices, then h(x) < 
E(x) for x G A n . 

2. E achieves its maximum value of n(n). 

3. E is lower semi- continuous. 
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The property [I] characterizes E uniquely. Moreover E is given concretely in 
terms of an elementary infinite sum (see equations (2.H) and ( \2.21\ )). 



The examples showing the lower bounds on Ciui in Theorem |l] are sharp 
and are constructed from the graph of E. The lower semi-continuity of 
E and the fact that E has n{n) as its maximum are important in these 
constructions. We note the mere existence of E (which follows from abstract 
considerations) is less important than the fact that E is given explicitly in 
a relatively simple form (cf. § |2.4| and Figure |2|) . 



We now give a more detailed description of our results. In §2.1 we give 
upper bounds on approximately convex functions which are locally bounded 
from above. Motivated by Perron's method in the theory of harmonic func- 
tions in § [2.2] we show that given a compact convex set K C R" with extreme 
points V and a uniformly continuous function ip: V — > R then there is a a 
unique extremal bounded approximately convex function Ek tp on K which 
agrees with p> on V; moreover, Ek u> is realized (as in Perron's method) as 
the pointwise supremum of all bounded approximately convex functions on 
K which agree with ip on V . The function Ek,u> is lower semi-continuous, 
characterized by a mean-value property, and satisfies a certain maximum 
principle. 



§ |2,3| and §2.4 contain a description of the extremal approximately convex 
function E on the simplex and proofs of the properties of E listed in Theo- 
rem^. In § |2.5| , we determine the extremal function Ek,<p when K is a convex 
polytope. A stability theorem with sharp constants for approximately con- 
vex functions of the type first given by Hyers and Ulam Q is given in §2.€ 



This states that an approximately convex function can be approximated in 
the uniform norm by a convex function with error only depending on the 
dimension of the domain. The example showing the constants are sharp is 
the extremal function E. The rest of Section ^ gives various other properties 
and examples of approximately convex functions. 

Section [3| gives the proof of Theorem |l| and some of its extensions and re- 
finements. The first two sections give the upper and lower bounds k{u— 1) < 
Cm. || < n(n) for general norms. The upper bound follows from the general 
upper bounds on approximately convex functions and the lower bound uses 
properties of the extremal approximately convex function E on A n . The 



proof that Clm = n{n — 1) in the Euclidean case is given in §3^. This 
requires some (hopefully interesting) geometrical arguments in addition to 
Theorem ^. Finally, we prove that Clm = 2 for all two-dimensional norms. 
This argument is somewhat ad hoc and does not appear to extend to higher 
dimensions. 

2. Approximately Convex Functions 

We first relate approximately convex functions to approximately convex 
sets. 
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2.1. Proposition. Let (X, || • ||) be a normed space, A C X, and define 
h{x) := dist(x,A). Then A is an approximately convex set if and only if h 
is an approximately convex function. 

Proof. If h(x) = dist(x, ^4) is an approximately convex function it is clear 
that A is an approximately convex set. Conversely if A is an approximately 
convex set, let xq,x\ G X and e > 0. Choose ao,ai £ A so that /i(xo) = 
dist(xo,^4) < ||xo — ao|| +e and h(x\) < \\xi — a\\\ +e. As A is approximately 
convex dist((ao + ai)/2,A) < 1. Thus 





x - a 


+ 


x\ — ai 






2 


2 



+ dist( , A) 



< h(x ) + h{xi) x 
2 



As e as arbitrary this completes the proof. (This proof is implicit in the 
paper of Casini and Papini [|], Prop. 3.4].) □ 

2.1. Bounds on approximately convex functions. The first bound is 
an extension to approximately convex functions of a standard result about 
convex functions. 

2.2. Proposition. Let U C R n be a convex set and h: U — > R be approx- 
imately convex and bounded from above by C. Then for any xq G U and 
x £ U Pi (2xo — U) (if xo is in the interior of U this is a neighborhood of xq 
inU) the inequality 

h(x) > 2h(x ) -C-2 

holds, and so h is bounded from below in U D (2xo — U) . Thus h is bounded 
from below on compact subsets of the interior of U. 

Proof. Let y = 2xq — x. Then y £ U as x £ (2xq — U). Also x$ = (x + y)/2. 
Thus 

Ha*) = h(^±y) < h{x) ± h{y) + 1 < h{x \ + c + 1. 

Solving this for h(x) completes the proof. □ 
The following theorem is one of our main results. 

2.3. Theorem. Let A C R™ with convex hull E = Co(vl). Let h: E -> 
R be an approximately convex function which is bounded above and which 
satisfies h < on A. Then 

n 



sup h{x) < [log 2 n] + 1 + 



Moreover this is the sharp upper bound (the sharpness follows from Theo- 
rem $.21 ). 



2.4. Remark. The assumption that h is bounded above can not be dropped. 
For the relevant example see Example 2.42 in §2.7 below. □ 
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Before giving the proof we give a name to the bounds in the Theorem and 
show that they satisfy a recursion which is a main ingredient of the proof. 
Let k(0) = and for n > 1 

TL 

(2.1) K (n) = [log 2 n] + 1 + 



2[ lo §2 n ] ' 

This notation will be use throughout the rest of the paper. 

2.5. Proposition. The sequence (k(w))2? = q satisfies the recursion 

(2.2) nln) = max v 1; ~T v 27 + 1 

ni+n 2 =n 2 
ni,ri2>0 

/or n > 1 . 

2.6. Lemma. Let (a(«))™ 6e a finite sequence on {0,1,..., m} so that 
(a(J) — a(j — l))j=i «s monotone decreasing (that is the sequence is concave). 
Then 



all) + a(j) 
max h 1 

i+j=n 2 



. . , a(ra) + a(n) 
a(n) + 1 = v 7 — ^ + 1, m = 2re; 

a(n) + a(n + 1) 

hi, m = 2n + 1. 



Proof. Let /3(«) = (a(i) + a(n — i))/2 + 1. Then the concavity of {a(i)) 
implies the sequence {fill)) is also concave. Also f3(i) = (3{n — i) so {/3(i)) 
is symmetric. But a symmetric concave function takes on its maximum at 
the center of its interval of definition. Thus if m = 2n is even the maximum 
is (3(n) = a(n) + 1 and if m = 2n + 1 the maximum is /3(n) = j3{n + 1) = 
(a(n) + a(n + l))/2 + l. □ 



Proof of Proposition 2.1. A calculation shows 



Kin) + n(n + 1) 
K(2n) = «(n) + 1, «(2n + l) = -^ — — + 1. 

(The second of these is most easily seen by writing n = 2 m + r where 
< r < 2 m — 1.) But the sequence (kIti) — k(u — l))fL-, is monotone 
decreasing so that an application of the last lemma completes the proof. □ 



Proof of Theorem \2.3j . Recalling the definition of n{n) we wish to show that 
sup xgE h(x) < n(n) We use induction on n based on the recursion (|2.2| ) 
satisfied by k. The base case of n = is clear. Suppose n > 1 and assume 
that the assertion holds for all integers less than n. If x G E then by 
Caratheodory's Theorem (cf. 0, p. 3]) there are XQ,...,x n € A so that 
x £ Co{xo, . . . ,x n }. Thus without loss of generality we may assume that 
E = Co{xo, • • • ,x n }. Let M := sup xeE h(x) < oo and let e > 0. Suppose 
that x = YHt=Q a kXk £ E (with X^fe=o a k = ^ an d «fc > 0) and h(x) > M — e. 
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By reordering the terms if necessary we may assume ao < a\ < ■ ■ ■ < a. 
Note that ao < l/(n + 1) < 1/2. Let n\ be the least integer so that 



Then^io a k < \. Set 



1 

S= 2 



k=0 



n\ — 1 



y o^, t — a ni s, 



k=0 



and let 

y = 2(^2 x k + sx ni J, z = 2 f tx ni + ^ a fc a; fe j . 

Then y £ E as X^A-io 1 a fc + s = \- Likewise z £ E. In particular y G 
Co{xo, • • • , x ni } =: Ai and z G Co{x ni , . . . , x n } =: A2. Then dim Ai = n\ 
and dim A2 = n — n\ =: 712. Since «o < 1/2 we have n\ > 1. 

If ni < n then n±,n2 < n and therefore by the induction hypothesis and 
x = \ (y + z) , we have 

M — e < h(x) < h{y) + ^ + 1 < K(ni) + K(n2) + 1 

< K(ni + 722) = ft(w). 

Therefore M < k(u) + e. This leaves the case n\ = n. Then z = x n G A 
and thus = 0. Whence 

, . h(y) + h(z) h(y) M 

M — e < h{x) < Ky> - v ; + 1 = + 1 < — + 1. 

Solve this inequality for M and use 2 < n{n) to get M < 2(1 + e) < 
n(n)(l + e). Combining the inequalities from the two cases and letting 
e \ implies M < k(u) and completes the proof. □ 



2.7. Remark. As many of our results will involve K(n) it is worth giving 
some sharp bounds on n(n). To do this extend k to the positive reals by 
defining k(x) = [log 2 x] + 1 + x/2^ log2X ^. Then for any integer m we have 
n{2 m ) = m + 2 = log 2 (2 m ) + 2. On closed intervals [2 rn , 2 m+1 ] the function 
k(x) is linear. Thus n(x) is the continuous piecewise linear function on 
(0,oo) with knots at x = 2 m and with k(x) = 2 + log 2 (x) at the knots. 
As the function 2 + log 2 (a:) is concave this implies k(x) < 2 + log 2 (x). On 
each of the intervals it is a straightforward calculus exercise to find the 
maximum of (2 + log 2 (x)) — k(x) on the interval [2 m ,2 m+1 ]. The result is 
(ln(2)-ln(ln(2))-l)/ln(2) « .08607133206 (surprisingly this is independent 
of which interval [2 m ; 2 m+1 ] we are working on). This leads to the bounds 



1.913928 + log 2 (n) < n(n) < 2 + log 2 (ra). 



□ 
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2.2. Lower semi-continuity and mean value properties of extremal 
approximately convex functions. Let K C R n be a compact convex set 
and let V be the set of extreme points of K. Let <p: V — * R be a function. 
Then a function h: K —* R ra has extreme values equal to ip iff h\ v = (p. 
(The terminology is a variant on that used in partial differential equations 
where the boundary values of a function are often prescribed.) Likewise if 
/, g : K — * R are two functions then / and g have the same extreme 
values iff they agree on V . If ip : V — > R, let B(K, ip) be the set of bounded 
approximately convex functions h so that h\ v < ip on V. Then the extremal 
approximately convex function with extreme values equal to ip is 

(2.3) E K ^(x) = sup h{x). 

h£B{K,tp) 

This is the pointwise largest approximately convex function with extreme 
values < <p on V . While in general we may have Ex,<p(v) < <p(v) for some 
v £ V, we will show that if tp is uniformly continuous on V (which will 
always be the case if V is finite) then Ex,ip\ v = <p and that Ex,tp is lower 
semi-continuous on K. 

Let K C R n be a compact set with extreme points V. Then for any 
function h : K — > R which is bounded above define Sh by 

(h(x), x £ V; 

inf{M + M + 1 : y -±^=x}, x £ K \ V. 

This operator is closely related to approximately convex functions as 

(2.4) / < Sf <^=^ / is approximately convex on K. 

Despite being nonlinear S is somewhat like a mean value operator. We make 
this more precise by proving a maximum principle for the equation 5/ = /. 

2.8. Theorem. Let K C R n be a compact convex set with extreme points 
V . Let f,F: K — » R be bounded functions so that Sf < / and F is approx- 
imately convex (that is SF > F). Let 

(2.5) E{x) = min{/(x), liminf f(y)} 

be the lower semi- continuous envelope of f . Then 

(2.6) su V (F(x) - /(X)) = sup(F(v) - f(v)) 
xgk »ev 

and 

(2.7) sup(F(x) - L{x)) = sup(F(u) - L(v)). 

xGK v£V 



Proof. We will prove (2.7), the proof of (|2.6|) being similar (and a little 



easier). The inequality Sf < f implies that if x ^ V then 

f(y) + f(z) t i _y + z 



(2.8) /(x)>inf{^±^ + l 



x 
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As / and F are bounded we may assume (after possibly adding positive 
constants to / and F) that < / < F < M for some positive constant M. 
This implies < L < F. Set u(x) := F(x) — L(x) and 5 := sup xeK u(x). 
Then we wish to show sup v£ y uj(v) = 5. If 5 = then L = F and there is 
nothing to prove. So assume 5 > 0. Choose a positive integer N so that 
N > M. Let < e < 1 and choose wq to be a point so that uj{wq) > 
(1 — e2~ N )6. Suppose that wq ^ V for sufficiently small e > (otherwise 
the desired conclusion follows as e — > 0). From the definition of L there is a 
sequence (£fc)fcLi such that Xk — > u>o and /(x^) — ► L(u>o). By equation ( |2.8| ) 
there are sequences (yfc)/~Li and (zfc)fcLi such that = (yk + £&)/2 and a 
real number C > such that 

(2.9) /fa)-( /( ^ + /(Zfc) + l)^C7>0 

By passing to a subsequence we may assume that y^ — ► y, — » z, /(yfc) — ► 
^4, and — > 2? for some y,z G K and j4,BsR. Clearly wq = (y + z)/2 
and (using the definition of L) L(y) < A, L(z) < 5. Then (|2 . 9|) yields 

(2 . 10) iW = £±^ + 1 + c >M±^£> + 1 

and so 

(2.11) F(w ) = L(w ) + lo(w ) > L ^+ L ^ + i + u ( Wo ). 

But since F is approximately convex 

/ \ F(y)+F(z) L(y) + L(z) u(y) + u(z) 

(2.12) F(w ) < Kyj ^ V ' + 1 = X + 1 + Z ■ 



Combining ( 2.12 ) and ( 2.11 ) yields 

(2.13) ^M±M > wW . 

Since <5 = sup^g^ w(a;) and w(wo) > (1 — e2~ N )5, (|2.13|) implies 

min{cu(y),w(z)} > 2uj{w ) - 5 > (2(1 - e2~ N ) - 1)5 = {l-e2^ N ^)5. 

From ( p.lO| ) have min{L(y), L(z)} < L(wq) — 1. Without loss of generality 
we may assume that L(y) < L(wo) — 1. Let w\ =y. ThenL(u>i) <L(wo) — 1 
andw(roi) > (1 -e2-( Ar " 1 )). 

If iui ^ V then we can repeat this argument (with iV replaced by N — 1) 
and get a w 2 G K with L(w 2 ) < L(ioi) - 1 and oj{w\) > (1 - e2 Ar ~ 2 )<5. 
We continue in this manner to get a finite sequence , Wi , ■ ■ ■ , w m with 
m < N so that for 1 < < m — 1 we have L(wk) < L(wk-i) — 1, tjj{wk) > 
(1 — e2 N ~ k )5, and r^fc ^ V. (Note this can not continue for k > N as that 
would imply L(wn) < L(wq) — N < M — N < contradicting L > 0. 
Thus 7i> m G V for some m < TV.) At the last step u; m G V and uj(w m ) > 
(1 - e2~( iV - m ))5. Therefore sup„ 6 „w(i;) > (1 - e2"( iV - m ))5. Letting e \ 
yields sup ve y u>(v) > 5. But supj, g y u(v) < 5 is clear. Thus sup vg y w(v) = 5 
as required. □ 
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2.9. Proposition. Let K C R" be a compact convex set with extreme 
points V and let h: K — » R n be a bounded approximately convex function 
on K . Then h(x) < Sh(x), the functions h and Sh have the same extreme 
values, Sh is approximately convex, and if h is lower semi- continuous as a 
function on K at points of V then the same is true of Sh. 

Proof. If x = (y + z)/2 then as h is approximately convex h(x) < (h(y) + 
h(z))/2 + 1 and taking the infimum yields h{x) < Sh(x). That h and Sh 
have the same extreme values is clear. Using the definition of Sh and the 
inequality h < Sh we have 

Sh (V + z\ < Ky) + h{z) Sh{y) + Sh(z) 

\ 2 J ~ 2 2 ' 

which shows Sh is approximately convex. Finally if h is lower semi-continuous 
at points of V then for x G V we have liminfy^.j; Sh(y) > lim inf j,-^ h(y) > 
h{x) = Sh(x). This shows Sh is lower semi-continuous at x and completes 
the proof. □ 

We now characterize the extremal functions Ejc ip as the unique bounded 
solutions to the equation Sf = f with extreme values <p. 

2.10. Theorem. Let K be a convex set with extreme points V and f:K—> 
R a bounded function so that Sf = f . Let ip := /|„ be the extreme values of 
f and let Ek, v be the extremal approximately convex function with extreme 
values (p. Then f = Ek^. 



Proof. The equality Sf = f implies / is approximately convex (cf. (|2.4|)). 
Then the extremal property of Ex,ip implies / < Ek^- Let F = Ek, ^ in 
Theorem ^8] and using that / and Ek^ agree on V we can use equation (|2.6|) 
to conclude / = Ek, v - □ 

The following is an elementary variant on Corollary 17.2.1 in ||. We 
include a short proof for completeness. 

2.11. Proposition. Assume K C R" is a compact convex set and V the 
set of extreme points of K . Let ip: V — > R be uniformly continuous. Then 
there exists a lower semi- continuous convex function h: K — ► R so that 
h\ v = tp. Moreover we can choose h so that mf x£ K h(x) = inf^gy 99(f) and 
suPzetf h (x) = sup„ ey tp(v). 

Proof. Let V be the closure of V. As <p: V — > R is uniformly continuous it 
has a unique continuous extension Tp: V — > R. Let Let Gjp := {(x,Tp(x)) : 
x G V} C K x R be the graph of Tp. As the set V is a compact and Tp is 
continuous the set Gjp is also compact. Therefore the convex hull Co(G^) 
is compact. Let A := mf ve y ip(v) = mm x£ yTp(x) and B := sup^^ ip(v) = 
m.ax x£ y Tp(x) . Then Co(G^) C K x [A, B]. Moreover, as K is the convex 
hull of its set of extreme points V, if x E K then there is y £ [A, B] so that 
(x,y) G Co(G ¥ ,). Define h by 

h(x) := mm{y : (x,y) G Co(G^)}. 
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It is clear from this definition that h is convex and has the same supremum 
and infimum as tp. We now show that h is lower semi-continuous. Let 
a £ K and let A := liminf x ^ a f(x). Choose a sequence {xi)'fL l so that 
X£ — > a and h{xg) — > A. Then as Co(G^) is compact (and thus closed) the 
limit lim^oo^, h{xi)) = (a, A) £ Co(G^). The definition of h then implies 
h(a) < A = liminf x ^ a h{x). Thus h is lower semi-continuous at a for every 
a G A. 

Finally let v £ V. Then as (v, h(v)) £ Co(G^) there exists (ao, • • • , a n +i) € 
A n+ i and v , ...,v n+ i G V so that (v,h(v)) = Y,k=o a k( v k,¥(vk))- But t; 
is an extreme point of K, which implies that Vk = v for all k and therefore 
h(v) =Jp(v) = tp(v). □ 



2.12. Theorem. Let K C R n 6e a compact convex set with extreme points 
V. Assume that tp: V — > R is uniformly continuous. Then the extremal 
approximately convex function Ejc i(p satisfies Ek^\ v = p and is lower semi- 
continuous on K. 



Proof. By Proposition 2,11 there exists a lower semi-continuous convex func- 
tion h: K — * R with extreme values tp. As h is convex it is a fortiori ap- 
proximately convex, h approximately convex (so that h < Ex,ip) we have 
for v £ V that tp(v) = h(v) < Ek, v {v) < tp(v), and so Ek, v has tp as ex- 
treme values. As h < i^ir « and /i is lower semi-continuous, the function 
Ek,w will be lower semi-continuous at all points x where Ek,^{x) = h{x). 
In particular, Ek^ will be lower semi-continuous at all points of V. Fi- 
nally as SEx,<p > Ek,<p (cf. ^S| ) the extremal property of Ek^ implies 
SEk, v = Ek,<p- Now in Theorem [E^ let / = F = Ek, v and let L be the 
lower semi-continuous envelope of / = Ek jV as given by (|2.5|). Then as -Ex,^ 
is lower semi-continuous at points of V we have that Efc,<p(v) = L(v) for all 
v £ V. Therefore (|2.7D implies that = L on K, so that -Eff^ is lower 



semi-continuous as claimed. □ 



2.13. Remark. Let -ftT C R n be a convex set with extreme points V. Let 
h : -fC — ► R be a bounded approximately convex function and let tp: V — * R 
be the extreme values of h, that is tp := h\ y . Then there is a bounded 
function /: K —> R such that f\ v = tp for which the inequality Sf < f 
holds pointwise on K. (Such a function exists as is seen by letting / = Ek^- 
On the simplex A n with tp = the function f(x) = k for x in the interior 
of a fc-dimensional face is an example of such a function.) Then define two 
sequences {hk)%L and (/jt)^L °f functions on ivT by 

/to = /i, /ifc+i = Sh k , /o = /, /fc + i = 

Then it can be shown that fk+i < /fe, ^fc+i > and that each is ap- 
proximately convex. (The statements about hk follow from Proposition |2.9| .) 
Also all the hk's and fa's have tp as extreme values. Therefore both sequences 
have pointwise limits hoo = limfc_ ) . o0 hk and /oo = limjfc_ >o0 These both 
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have ip as extreme values, Shoo = ^-oo, and •S'/oo = /oo- Therefore by Theo- 
rem 2.10 we have = = Ex,ip- This gives a method for finding Ex,ip 



as the limit of two more or less constructively defined sequences. Also note 
that for each k we have the inequalities 

hh < Ek, v < fk- 

Thus we have explicit upper and lower bounds for Ek, v - □ 

2.3. The extremal approximately sub-affine function H(x). A func- 
tion /: [0, 1] — > R is approximately sub-affine iff 

(2 14) f( X + V ) < + | X + V 



2 J ~ 2 2 

As in example |2.4Cj below approximately sub-affine functions can be used 
to construct approximately convex functions on a simplex. As a first step 
in explicitly describing the extremal approximately convex function on a 
simplex we describe the extremal approximately convex function on the 
unit interval. 

Let N = {0, 1,2,...} be the natural numbers and let T> be the dyadic 
rational numbers in [0,1]. That is 

— : m, n G N and < m < 2 n }. 



(These play a considerable role in what follows.) The numbers in [0, 1]\T> will 
be called the dyadic irrationals. Every dyadic irrational x has a unique 
binary expansion x = Si=o x */^ with Xi G {0, 1}. If x G V then there are 
two binary expansions: the finite expansion x = YliLo x i/2 l and, if Xj^ = 1, 
there is also the infinite expansion x = Yli^ 1 x il^ % + X^jv+l V^*- Unless 
stated otherwise we will always use the finite expansion for an element of 
T>, even when we write x = x i/2 l for notational uniformity. With this 
understood, define H : [0, 1] — > R by 

oo oo 

(2.15) H{x) := i-^T where x = -4. 

i=0 i=o 

For motivation see Remark |2.20| . A graph of H is shown in Figure |l[ 

We now derive another representation of H. Let r : R — > R be defined by 



r(x) :- 



0, 0<x<l; 

1, l<x<2, 



and extend to R by periodicity: r(x + 2) = r(x). If < x < 1 and a; has 
binary expansion x = ^i=i x i/^i where Xi G {0, 1}, then it is not hard to 
see that Xi = r{2 l x) (if x is a dyadic rational we check to see this does give 
the finite expansion). It follows for < x < 1 that x = X]£i r{2 l x)/2 l . 
More generally if we let {x} = x — [x] be the fractional part of x then as 
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both {x} and r(2 l x)/2 l are periodic with period 1 and {x} = x for 

< x < 1 we have 



(2.16) 



i=i 



If H is extended to R to be periodic, H(x + 1) = H(x), (this is possible as 
H(0) = H(l) = 0) then the definition of H becomes 



(2.17) 



^ r{T x) 



i=i 




02 0.4 0.6 0. 



Figure 1. Graphs of y = H(x), y = xlog 2 (x), and y = 2x + log 2 (a;) 

for < x < 1. 



2.14. Proposition. Lei the function H be extended from [0, 1) to R so i/iai 
£f is periodic: H(x + 1) = H(x). Then H satisfies the functional equation 



H{x) = {x} + -H(2x) 



(2.18) 

and thus H has the series representation 
(2.19) 



fc=0 
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This implies H is lower semi- continuous, continuous at all points of [0, 1]\T> 
and right continuous at all points. 

Proof. This is a calculation based on the two series ( |2.17| ) and ( |2.16| ). 

i=2 
3=1 

To prove the series representation (2.19) for H(x) observe that an induction 



using the functional equation fl2.1£ ) yields 

H{x) = Y + -L-H(2 m+1 x) 

fc=o 

and as < H(x) < X^iV^ = 2 the series converges uniformly to H{x). 
The functions x i— > {2 fc x}/2 fc are lower semi-continuous and right continuous 
and hence so are the partial sums Sn 

i x ) = Sfc=o {2 fc x}/2 fc . Thus is 
the uniform limit of lower semi-continuous and right continuous functions 
and therefore is lower semi-continuous and right continuous. Finally the 
functions {2 k x}/2 k are continuous at all points of [0,1] \ T>. As the series 
converges uniformly this implies that the sum H is also continuous at these 
points. □ 

2.15. Remark. The graph of H(x) has an interesting "self-congruence" prop- 
erty. The series (2.19) for H(x) implies for m a positive integer that 



1 m—i i 

H { x + 2^) = E ^ ({2fcx + ~ {2kx}) + H{X) = Pm{x) + H{X) 

k=0 

where this defines P m (x). It is not hard to check that the functions ({2 k x + 
2 k~my _ { 2 fc x})/2 fc are all constant on intervals [i/2 m , (i + l)/2 m ) and so 
the same will be true for P m (x). This implies for any i and j that the 
graph of the restriction H\ ^ 2m ^ i+l y 2 m ) ls a translation of the graph of 
^l[i/2' n 0'+l)/2 m )' ^° m f° rman y anci somewhat imprecisely "the graph of H 
is locally self congruent at all the scales l/2 m ". If F is the closure of the 
graph of -ff |[q ^ then this, and some calculation, can be used to show F can 

be covered by 2 m closed sets of diameter < 4m2 _m . Thus for any S > the 
Hausdorff 5-dimensional measure of F is < 2 m (4m2 -m ) <5 and when 5 > 1 we 
have 2 m (4m2- m ) 5 -» as m — > oo. Therefore the Hausdorff dimension of F 
is < 1. But as F projects onto the interval [0, 1] its Hausdorff dimension is 
> 1. Thus F has Hausdorff dimension one. (With a little more work it can be 
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shown the one dimensional Hausdorff measure of F is infinite.) However F 
is compact, separable, totally disconnected and has no isolated points. Thus 
F is homeomorphic to the Cantor set and therefore of topological dimension 
zero. Whence the closure of the graph of F is a "fractal" in the sense that 
its geometric dimension is greater than its topological dimension. □ 

2.16. Proposition. The function H is approximately sub-affine: 
ZJ ( x + y \ ^ Hjx) + H(y) x + y cIni] 



2.17. Lemma. If x = YliLoH/^ 1 e ^ with each k a nonnegative integer, 
then 

N /■ 

with equality if and only if each Zj G {0, 1}. 

Proof. If (Zo, h, ■ ■ ■ , In) is a finite sequence with — we ^ 

A(Zo, • • • , Zjv) := X^i=o The P ro °f is by induction on m = A(Zo, • • • , Zjv). 
If m = then each Zj = and x = H{x) = and the result is trivial. Now 
assume the inequality holds for all (Zo, h, ■ ■ ■ , In) with A(Zo, ■ ■ ■ , In) < in- 
Let k be the least integer such that Ik > 2 (if all Ik £ {0, 1} there is nothing 
to prove). Note k / as 2/2° = 2. Then 

k-2 , N , 

EH . l k—l L k \ H 
~2~i 2^ ^ ~¥ 

i=0 i=k+l 

k-2 , , , AT AT 

E'j tfc-1 + J- . *fc ~~ ^ \ -> H _ \ -> 
2« 2^ — 1 2 fc ^ 2* ~~ ^ 2* 

i=0 i=k+l i=0 

where the last line defines the r\ implicitly. Then 

A(r , ...,r N ) = A(Z , . . . , h-i + 1, h - 2, • • • , In) = A(Z , . . . ,l N )-l = m-1. 
Thus the induction hypothesis gives 

N I ".r; .2 ,. 1 



E4 = E^+^-(*-i) 



'2» ^ 2 i 2 fe / 2 fc_1 

=0 i=0 



= Yi r 4 + k -&-V > Yi r 4>H 



'2' 2 k ~ l ^ 2 

j=0 i=0 



This gives H(x) < J2iLo h/^ 1 unless Zj G {0, 1} for all i. This completes the 
proof. □ 
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Proof of Proposition \2.16 . First consider the case x,y G T> so that x 

N " 
i=0 : 



Y^iLo x i/ 2 \ U = Silo W 2 *- Tli en by Lemma |2~T7 



H 



x + y 



N 



« £ 



8=0 

N 



Xj + yj 
2 i+i 



N 



< 



j=0 



g + 
2 i+i 

iV AT 



2 I 2^ ? 2* + 2^ z 2« / 2 I ^ 2* 2* / 

S=o i=l 7 v i=o i=l 7 



H(x) + H(y) | x + y 



If x is a dyadic irrational and y £P then we use that by Proposition |2.14 the 
function is lower semi-continuous on R and continuous at x. Let x(r) £ T> 
so that lim r .^ 00 x(r) = x and so by continuity lim r _,.oo H (x(r)) = H(x). 
Thus 



H 



x + y 



< lim inf H ( ^ ^ 

r^oo \ 2 



< lim 

r— >oo 



H(x{r))+H(y) x{r)+y 



+ 



H(x) + H(y) + x + y 



2 2 

The case where both x and y are dyadic irrationals is handled similarly. □ 



2.18. Proposition. Suppose f is a lower semi- continuous approximately 
sub- affine function defined on [0, 1] such that /(0) = 0. Then f(x) < H{x) + 
f(l)x for all x 6 [0, 1]. 

First some preliminaries. If x = ^2f = iXj/2^ £ T> define the dyadic 
support of x to be {j G N : Xj = 1} and denote it by suppx. 

2.19. Lemma. If x,y £V and (suppx) n (suppy) = then 

Trf x + y \ = -g(x) + H(y) x + y 
V 2 J 2 2 ' 



2.20. Remark. This lemma motivated the definition of -ff. As the proof of 
Proposition 2.18 makes clear this is the property which implies H is the 
largest lower semi-continuous approximately sub-affine function on [0,1]. 
It also allows one to compute the values of H on V leading to the for- 
mula drj) . □ 



Proof. The condition on the dyadic supports implies that the binary expan- 
sion oi x + y can be computed by just adding the digits without "carrying". 
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Thus for sufficiently large N 



N 



V j=0 3=0 7 

H(x) + H(y) x + y 

2 2' □ 



Proof of Proposition 2.1$ . If /(x) is replaced by <p(x) := f(x) — f(l)x then </? 



will also be approximately sub-amne and <p(0) = <p(l) = = H(0) = H(l) 
We now show by induction on k that if x = m/2 k G T> then ip(x) < H(x). 
The base case of fc = holds. Now assume that x = m/2 k and that the 
result is true when the denominator of the fraction is a smaller power of 2. 
We may assume that m is odd. If x < 1/2 let y = 2x = m/2 k ~ l . Then 
x = (0 + y)/2, (p(y) < H(y) and supp(0) n supp(y) = 0. Therefore 



<p(x) = ip( 



+ y\ < ip(0) + <p(y) + y 
~ 2 2 



<H(0) + g(») + g + jl = g (0+j!) =g(l) , 

If 1/2 < a; < 1 then let y = 2x — 1 so that x = (y + 1)/2. Then as the dyadic 
supports of y and 1 are disjoint, a calculation like the one just done shows 
(p(x) < H{x). Thus ip(x) < H(x) for all x S V. For any other x £ [0, 1] \ T> 



choose Xk £ T> with x^ — * x. By Proposition 2.14 H is continuous at x. 
Therefore the lower semi-continuity of ip implies 

(p(x) < liminf (/?(xfc) < lim H{xk) = H{x). 

k^oo k^oo 

Finally <p(x) < H{x) is equivalent to the required inequality for /. □ 

2.21. Proposition. The inequalities 

(2.20) xlog 2 (l/x) < H(x) < 2x + log 2 (l/x) 

hold for < x < 1 (cf. Figure \^). 

2.22. Lemma. Let <p(x) := xlog 2 (l/x) = — xln(x)/ln(2). Then for < 
t < 1 and x, y £ [0, 1] 

< ip((l - t)x + tx) - ttp(x) - (1 - t)ip(x) < tp(t)x + <p(l - t)y. 

As <p(l/2) = 1/2 this implies ip is approximately sub-affine on [0,1]. 

Proof. The left hand inequality follows from the concavity of (p. To prove 
the right hand inequality we first assume < x < y < 1. For fixed t and y 
let 

F{x) := <p({l - t)x + ty) - (1 - t)<p(x) - My). 
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Then 

F'{x) = ^y(Mx) ~ HO- ~ + tv)) < o- 

Therefore F is monotone decreasing and so the maximum of F{x) on [0, y] 

occurs when x = 0. But 

cvnN / 1 \ . / v -(iyln(ty) -tyln(?/)) -tln(t) 

F(0) = p(ty) - ttfy) = = Ir^f W = ^ 

So for all < x < y and < t < 1 

(p((l - t)x + ty) - tip(x) - (1 - t)<p(y) < <p(t)y < <p(t)y + tp{\ - t)x 

(for the last step note that <p(l — t)x > 0). A similar argument works in the 
case y < x (or replace t by (1 — t) in what has been shown). □ 



Proof of Proposition 2.21 . As the function <p(x) = xlog 2 (l/x) is approxi- 
mately sub-affine, vanishes at the endpoints of [0, 1] and is continuous the 
lower bound of ( 2.20 ) follows from Proposition |2.18| . To prove the upper 



bound we use the series ( |2.19 ). Let < x < 1. There exists a unique non- 



negative integer m so that 2 m x < 1 < 2 m+1 x (i.e. l/2 m+1 < x < l/2 r 
Then for < k < m we have {2 fc a;} = 2 k x, and thus 

H{x) = Y,y;{'Z k x}<(™ + l)x+ J2 ^k=( m + 1 ) x + ^- 

k=0 k=m+l 

So to complete the proof it is enough to show 

/ 1 \ — xln(x) 1 
ip(x) := 2x + x\og 2 (l/x) -^ m + l)x + —)= — — (m - l)x + — 

satisfies ip(x) > for x G [l/2 m+1 , l/2 m ]. But ip(l/2 m+1 ) = tp(l/2 m ) = 
and f{x) = -l/(sln(2)) < 0. So ip is concave on [l/2 m+1 , l/2 m ] and 
vanishes at the endpoints which implies ip > on the interval. □ 

2.4. The extremal approximately convex function E{x) on a sim- 
plex. Let eo, • • • , e n be the standard basis of R n+1 . Then the standard sim- 
plex is, as usual, A n = Co{eo, • • • , e n }. We will often write points of A n in 
terms of their affine coordinates (xo, . . . ,x n ) where x^ > and J2k=o x k = 1- 
This corresponds to Y2k=o x k e k- Define a function E on A n as follows: 

(n \ n 

x k e k ) = E(x , ...,x n ):=J2 H{x k ). 
k=0 ' k=0 

2.23. Remark. If [i is a finite measure space and A is a finite algebra of mea- 
surable sets with atoms Aq, A±, . . . , A n the entropy of A is — X]fc=o A*(-^fc) l n ^iAk] 
If x G A n we can think of x as a measure on {0, 1, , . . . , n}. If A is the alge- 
bra of subsets of {0, 1, , . . . , n} then its entropy with respect to the measure 



determined by x is — Ylx k x k hiXfc. By Lemma 2.22 the function xlog 2 (l/x) 
is approximately sub-affine and so H can be viewed as an extremal version 
of xlog 2 (l/x). To the extent that H{x) and — xln(x) can be thought of as 
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analogous functions, E(x) = X]fc=o ^( x k) can be viewed as a "poor man's" 
version of the entropy. The inequalities |2,20 make this analogy somewhat 
precise. □ 

The standard dyadic simplex is 

f n n n 

Vn := i ^2 x k e k ■ x k ev, y^xfc = i y 

^ k=0 k=0 J 

Like T> C [0, 1] the set T> n will play a large role. 

2.24. Proposition. The function E is approximately convex and lower 
semi-continuous on A n with E{e^) = for < k < n. The points of 
continuity of E are the points x = (xo, . . . ,x n ) such that all the coordinates 
Xk are dyadic irrationals. Moreover E satisfies the inequalities 

n n 

^x fe log 2 (l/x fc ) < E(x eo H h ^2 x k log 2 (l/aJfe). 

k=0 k=0 

Proof. For x G A n the functions x i— > H{xk) are lower semi-continuous by 
Proposition 2.14. Thus E will also be lower semi-continuous. Also from 
Proposition the points of continuity of H are the dyadic irrationals in [0, 1]. 
This implies the statement about the points of continuity of E. As H is 
approximately sub-affme we have 

E ( x + y \ _ Y^n ( ' J>> " 



k=0 

< 



H(x k ) + H(y k ) < ^Xk + yk 
2 

k=0 k=0 

E(x)+E(y) 



y \xk) -r n yyk) + ^ 

+ 1 



2 

as J2k=o Xk = Sfc=o = 1- So is approximately convex as claimed. That 
E(ek) = follows from -ff(O) = H{\) = 0. The bounds for E follow from 
the inequalities Q2.2C| ). □ 

It is possible to give an explicit formula for E on the one dimensional 
simplex. 

2.25. Proposition. Let the one dimensional simplex Ai be identified with 
[0, 1] in the usual manner (t corresponds to (1 — t)eo + te\). Then 

(2, t^v- 

(2.22) E(t) = { 1 m 

2 j—r, —r&T> with m odd. 

Proof. Set ip{t) = {t} + {l-t} = {t} + {- t}. Then by ( gig ) 

" ^(2 fc t) 



(2.23) £7(t) = ff(t) + ff(l - i) = 



2 fe 

k=0 
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But then ip(t) = for t £ Z and ip(t) = 1 for t £ Z. So if t £ V we have 
i/j(2 k t) = 1 for all k. If t = m/2 l with m odd then ^(2 k t) = 1 for < Z and 
ip(2 k t) = for k > I. Now the required formula for -E(i) follows from the 
series (§3|). □ 

Unfortunately, in higher dimensions i? is not as easy to understand. A 
graph of E on the two dimensional simplex is shown in Figure 0. 




Figure 2. Graph of z = E(x, y, 1 



x 



y) for 0<x<l — y < 1 



. . . turn Tartarus ipse 
bis patct in pracccps tantum tcnditquc sub umbras 
quantus ad actherium cacli suspcctus Olympum 

Rcspicit Aeneas subito et sub rupc sinistra 
moenia lata vidct triplici circumdata muro 
quae rapidis flammis ambit torrentibus amnis 
Tartarcus Phlcgethon, torquctquc 
sonantia saxa 



Then Tartarus itself goes plunging down 
In darkness twice as deep as heaven is high 
For eyes fixed on etherial Olympus 

The Hcroc. looking on the left, espy'd 

A lofty Tow'r, and strong on ev'ry side 

With treble Walls, which Phlcgethon surrounds, 

Whose fiery flood the burning empire bounds: 

And prcss'd betwixt the Rocks, the bellowing noise 

resounds. 



Vergil, The Aeneid 



Translations by Robert Fitzgerald and John Drydcn 



2.26. Remark. The graph (Figure of E suggests that E has some self 
similarities. This is indeed the case as we now briefly indicate. For each 
k G {0, . . . , n} define a map 8^ : A n — > A n by 

a l \ ek + x 



2 

This is the dilation by a factor of 1/2 centered at e& and it maps A n onto 



its subset defined by 1/2 < Xk < 1. The functional equation ( |2.18| ) for H 
can be rewritten in the from H(t/2) = {t/2} + ^H(t/2). We leave it as 
an exercise for the reader to show this (and H{t + 1/2) = H{t) + 1/2 for 
< t < 1/2) can be used in the definition of E so show that for any x £ A n 
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which is not a vertex that 



E(6 k (x)) 



l + -E(x). 



Thus if on the space A n x [0, oo) a map 0^. is defined by Ofc(x, z) = (dk(x), 1+ 
z/2) then the graph of E (with the points over the vertices deleted) is 
invariant under 0^. Each 0^ is the dilation by a factor of 1/2 with center 
(e/%, 2). This explains the self similarities of the graph of E. □ 



Our next result implies that the upper bound of Theorem 2.3 is sharp. 
Recall that a subset of a metric space is a G$ iff it is a countable intersection 
of open sets. 

2.27. Theorem. The function E achieves its maximum value of n(n) on 
an uncountable G$ subset of A n . 



2.28. Remark. The maximum of E does not occur at the center (l/(n + 
1), . . . , l/(n + 1)) of A n . Given the symmetry of the problem this is a little 
surprising. □ 



Proof. That sup.E[A„] < n(n) follows from 2J3. To show the maximum is 
obtained, let m = [log 2 (n)] so that n = 2 m + r with < r < 2 m . Suppose 
x = (xq, . . . , x n ) G A n with each coordinate x k a dyadic irrational. In 
particular, if x^ = X^j=o Xfc i then xjy is zero for infinitely many j and 
one for infinitely many j. Let Mj(x) := #{k : x^j = 1}. We claim that 
E(x) = n(n) provided each coordinate Xk is a dyadic irrational and 



(2.24) 



Mj(x) 



0, 

n - 
re, 



Let K be the set of all x 
x £ K, then 



j < m; 
2r, j = m + 1; 
j > rre + 2. 

: n ) that satisfy these two conditions. If 



fc=0 j=0 fc=0 



M, 



m+l 



1 



2m+l 



re 



oo 

^ 2-? 

j=m+2 



re 



2r 



re 



re 



2"i+i 2 m +i 



Thus x G A n and so if C A n . 

To see that K is uncountable (and thus nonempty) let (a m +2,a m +3, 
be a sequence in {0, 1, ... , re} such that for every k € {0, 1, . . . , re}, a,j 



for infinitely many j. We let x^j = if j < m and we let Xk 
exactly re — r many /c. For j > rre + 2, let 



m+l 



1 for 



a>j / i; 

a, = k. 



Since each sequence {xkj)jjL has infinitely many zeros and ones, each x^ is 
a dyadic irrational. Thus x = (xo, . . . ,x n ) £ K. As there are uncountably 
many such sequences (a m+ 2, a m +3, • • •) the set K is uncountable. 
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If x = (xo, • • • ,x n ) G K then, using the definition ( 2.1 5| ) of H and the 
identity £°l m+2 j/2 j = (m + 3)/2 m+1 , we have 

e(x) = X>w = f; ^ = Mm+ 2 'i'"i + 11 + " E | 

fc=0 j=0 j=m+2 

_ (n - 2r)(m + 1) n(m + 3) _ (2n - 2r)(m + 1) + In 
<2m+i 2 m +i 2 m +i 

n 

= m + 1 H = Kin). 

2 m 

This shows that -E achieves its maximum at all points of K. Finally {x G 
A n : = ft(n)} = n£i^ _1 [( K ( n ) ~~ l/^i 00 )] an d each of the sets 

E~ x [(«(n) — oo)J is open as i±7 is lower semi-continuous. Thus {x G 
A n : E(x) = k(ii)} is a G^. □ 

2.29. Remark. With a little more work it can be shown that E(x) = n{n) if 
and only if x £ K with K as above. □ 

2.30. Theorem. The function E is the largest bounded approximately con- 
vex function on A n that vanishes on the vertices. More precisely, if h 
is any bounded approximately convex function on A n with h(ek) < for 
k = 0, 1, . . . , n, then h < E on A n . 

2.31. Corollary. Let h: A n — > R be an approximately convex function that 
is Borel measurable. Then for any x = Ylk=o x k e k the inequality 

n 

h{x) < k{u) + Xfcfe(efc) 

k=0 

holds. In particular, if h(ek) < for all k, then h < n{n). 



Proof of Theorem \2. 31 . Define I on A n by l(x) = Ylk=o x kh(^k)- Then the 
function h(x) — l(x) is approximately convex, Borel measurable, and vanishes 
on the vertices of A n . So by replacing h by h — I we may assume h vanishes 
on the vertices of A n it will be enough to show h < k(u) on A n . We 
do this by induction on n. For n = 1 it follows from results of Ng and 
Nikodem [||, Cor. 1 and Thm 2] that h is bounded above. But then h < 
k(1) = 2 by Theorem |2.3| . Now let n > 2 and assume the result holds for all 
simplices with dimension < n. Consider A n _i as a face of A n in the natural 
way (A n _i = Co{eo, • • • , e n _i} C Co{eo, . . . , e n }). Then by the induction 
hypothesis h\ A < k(u — 1). Now any point x € A n has a representation 

'^n — 1 

as x = (1 — t)e n + ty where y G A n _i and t G [0, 1]. But then the one 
dimensional result (applied to the restriction of h to the segment between 
eo and y where we note that this restriction is Borel and thus Lebesgue 
measurable) implies 

h(x) = h{(l - t)e n + ty) < 2 + (1 - 2)h{e n ) + th(y) 

< 2 + + tK(n - 1) < 2 + Kin - 1). 



APPROXIMATELY CONVEX FUNCTIONS AND THE SIZE OF CONVEX HULLS 23 



Thus h is bounded above on A n . But then we can use Theorem 2.3 and 



reduce the bound to n(n). This completes the proof. □ 

2.32. Corollary. Let U C R™ be a convex set and let h: U — > R be either 
Borel measurable or bounded above on compact subsets of U . Then for any 
m < n, points xq, . . . , x m G U and (ao, • • • , a m ) G A m , we have 

h(a xo H h 

a m x m) 5; E(cxq, . . . , a m ) + uoh(xo) + • • • + a m h(x m ) 
< K(m) + aoh(x ) H h a m h(x m ). 

Proof. Define /: A m -> R by /(a , . . . , a m ) := h(a x + h 

(aoh(xo) + • • • + a rn h(x m )) . Then / is approximately convex and bounded 
above on A m or is Borel measurable on A m . As / vanishes on the vertices 



of A m either Theorem 2.30 or Corollary 2.31 implies / < E < K,(m) on A 



1 m- 



This is equivalent to the conclusion of the corollary. □ 
We start the proof of Theorem |2.30| by extending the idea of the dyadic 



support from V to V n . If x = J2k=o ( J2f=o x ti, &)/ 2J ) e k £ (here 



x(j, k) G {0, 1}) then set 

(2.25) sup V x:={{j,k) :x(j,k) = l}. 

The following is trivial to prove using Lemma |2.19| and the definition of E 
in terms of H. 

2.33. Lemma. If x,y £ V n and (suppx) n (suppy) = then 

+ E(x)+E(y) 
E \—) = 2 + L □ 

2.34. Lemma. If x G T> n and x ^ {eo, • • • , e n }, then there are y,z G T> n so 
that x = (y + z)/2 and (suppy) n (suppz) = 0. 



Proof. Letting x = ]Cfc=o [Ylj=o x (j> ^)/ 2 J &k It suffices to show that there 
are nonempty sets A, B so that An B = and 



x(k,j) = 1 = x(k,j) 



(j,k)GA {j,k)£B 

For then if a = £(j,fc)eA x ( k ' j)/2 J_1 e fc and 6 = J2(j,k)eB x(k, j) /2 : >- 1 e k we 
have a, 6 G P n , (suppa) n (supp6) = and x = (a + 6)/2. 

We first prove by induction on Ylj=i a j that if ai, . . . , oat are positive inte- 
gers so that Sj=i a j/2' 7 = 1 then there are bj, Cj G N such that Ylj=i bj/2 3 = 
J2j=iCj/ 2j = V 2 - Note that On is even (otherwise 2 N Y^j=\ % N 3 a>j would 
not sum to 1) and so ajv — 2 > 0. Therefore 
N-2 1 

oj + oJV-1 + 



23 2 N ~ X 2 N 

3=1 
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Since Y^j=i +{ a N-i + 1) + (cln ~ 2) = Y,f=i a j ~ 1 we ma Y a PP!y the 
induction hypothesis, which yields the claim. 

Now let x £ T> n be as above. Let aj := #{k : x(j,k) = 1}. Then 
X]jLi a i/2- J = 1- Therefore we have aj = bj + Cj as above. Then splitting 
each of the sets {k : x(j, k) = 1} into two disjoint sets Aj and Bj with 
#(A,-) = bj and #(#,•) = Cj we let ^ := uf =1 Aj and S := ujLi-Bj- This 
completes the proof. □ 

2.35. Proposition. Let h be any approximately convex function on A n 
(not necessarily bounded above) such that h(ef.) < for < k < n. Then 
h(x) < E(x) for all x £ V n . 

Proof. The proof is by induction on m = #(suppx). If m = 1 then x = e k 
for some k and h(e k ) < = Eie.^). Now assume that h(x) < E{x) for all 
x with ^(suppx) < m — 1 and let suppx = m. By Lemma |2.34| we can 
write x = {y + z)/2 with #(suppy), #(suppz) <m — 1. Using the induction 
hypothesis and Lemma [2.33| 



fe(g) = ) < %) + *(*) + 1 < m + e{z) + 1 = E{xy 



□ 



The following lets us pass from knowing inequalities for E on T> n to prov- 
ing them on A n . 

2.36. Lemma. If x £ A n i/ien i/iere is a sequence (x(r))°!L 1 from T> n so 
that lim r ^ 00 x(r) = x and lmv— >0 o E{x{r)) = E{x). 

Proof. Write x = J2k=o x k e k- By reordering we can assume for some £ £ 
{0, . . . , n} that x k £ V for < k < £ and x k £ V for I + 1 < k < n. For < 
k < £ set x k (r) = Xfc for all r. As ^^ =0 ^fc = 1 and Ylk=o x k S P (as G P 
for each x k in this sum) the sum 5 := Y^k=e+i x k = 1 — Ylk=o x k wm a l so be a 
dyadic rational. Let A n _ £ _i(<5) = {^ =£+1 a fc e fc : a k > 0, Sfc=m a k = $} 
andP„_^_i((5) = {Ylk=e+i a ^k ■ a k G £>, Efc=^+i Q fc = <*}• Then X> n _.g_i (£) 
will be dense in A n _£_i(5) so there is a sequence y(r) = Ylk=e+i 2/fc( r ) e fc with 
limj._i.oo y(r) = y. Set x k (r) = y k (r) for £ + 1 < k < n. Then Xfc(r) = x k £ V 
for < k < £ and lim r _ s . 00 x k (r) = x k ^ T> for £ + 1 < k < n. Set 
x ( r ) = Sfc-o x k{ r ) e k- Then x(r) G £>„ and ]im r -> 00 x(r) = We now use 
the definition of E in terms of H and the fact that H is continuous at all 



dyadic irrationals (Proposition 2.14 ) to obtain 



lim E(x(r)) = H(x k ) + lim V H(x k (r)) = H(x k ) = E(x). 

fc=0 fc=£+l fc=0 □ 



Proof of Theorem 2.3L . Let Ea„,o extremal approximately convex function 
on A n that takes the values on the vertices (cf. (p.3|)). We wish to show 
E = -Ea„,o- The inequality E < E^ nt o follows from the definition of £y\„,0) 



so it is enough to prove £a„,o < E. By Lemma 2.36| there is a sequence 



x{r) £ V n such that lim r ^ 00 x(r) = x and lim^oo £ , (x(r)) = E(x). By 
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Lemma 2.35 E / \ n fi(x(r)) < E{x(r)). By Theorem 2.12 the function £a„,o 



is lower semi-continuous. Therefore 

EA n ,o(x) < liminf EA n ,o(x(r)) < lim E(x(r)) = E(x). 



□ 



2.5. Extremal approximately convex functions on convex polytopes. 

Let K C R n be a compact convex set with extreme points V and let 



ip: V — > R be bounded. In §2^ we defined the extremal approximately con- 
vex function Ek, v with extreme values <p but without being explicit about 
how to compute it. In §2^ we gave a very explicit description of E = -Ea„,0) 



the extremal approximately convex function on the simplex. Here we show 
that when K is a polytope (that is the convex hull of a finite number of 
points) then Ek, v can be expressed directly in terms of i?A m ,o for some 
m. We first establish some elementary properties of approximately convex 
functions under affine maps. 

2.37. Proposition. Let A C R m and B C R" be convex sets andT: R m -> 
R n an affine map. 

1. If T[A] C B and f is an approximately convex function on B then 
T*f{x) := f{T{x)) is an approximately convex function on A. 

2. If T[A] D B and h is an approximately convex function on A which 
is bounded from below then T*h(y) := ini T r x \ =y h{x) is approximately 
convex on B. 

3. Both T* and T* are order preserving. That is f\ < fi and hi < h% 
pointwise implies T* f\ < T* fa and T*h\ < T^hi pointwise. 

4. If T[A] = B, h is approximately convex and bounded below on A and 
f is approximately convex and bounded below on B, then T*T*h < h 
and T,T*f = f. 

Proof. This is just a chase through the definitions of T* and T*. □ 

Let K be a convex polytope in R n with extreme points V = {vo, . . . , v m } 
and extreme values given by (p: V — » R. and let ao, ■ ■ ■ ,a m be real num- 
bers. We wish to find the largest approximately convex function F on K 
so that F(vk) = <p(vk) for < k < m. Toward this end let E = i?A m ,o be 
the extremal approximately convex function on the simplex A m and define 
EA m ,<p on A M by 

m 

EA m ,<p( x ) = EA m ,<p(vo, ■ ■ -,x m ) := EA m ,o(xo, ■ ■ -,x m ) + > j x k ip(a k ). 

k=0 

(This is a slight misuse of notation as ip is a function on the extreme points 
V of K rather than the set of extreme points {eo, • • • , e m } of A m .) Then, 
as x i ^ J2T=o x k a k is affine, the function i?A m ,^ is approximately convex 
on A m and satisfies EA m ,ip(e.k) = p{vk). Moreover i?A m ,^ is the extremal 
approximately convex function on A m tctking on th.6S6 values on the vertices 
in the sense that if /: A m —* R is approximately convex and bounded 
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above, lower semi-continuous, and /(e&) < (f(vk) then /(x) < E/^ mitp (x) for 
all x G A m . 

Returning to our extremal problem there is a unique affine map T : A m — > 
K such that T(e/ C ) = v k for < k < m. Then T[A m ] = K. Define 
F K/ .K ^Rby 

Then another definition chase shows Fx,ip{vk) = 

2.38. Theorem. Using the notation above, the extremal approximately con- 
tinuous function on the polytope K with extreme values (p is 

Ek,p '■= T*E& 0j(p . 

The function Ejctp is lower semi- continuous. 

Proof. Let /: K — > R be approximately convex, bounded, and and satisfy 
f(vk) < l -p{vk)- Then the function T*f on A m is approximately convex, 
bounded, and T*f(e k ) = f{v k ) < tp(v k ). Therefore T*f < E Am , v . But 
then / = T*T* f < T*E^ m ^ which proves T *E& v = E K>ip . The lower 
semi-continuity of Ek,w follows from Theorem [2.12 , □ 



2.6. A stability theorem of Hyers-Ulam type. Here we give a stability 
result for approximately convex functions related to and motivated by a 
theorem of Hyers and Ulam Q. The idea is that an approximately convex 
function is close (in the uniform norm) to some convex function. 

2.39. Theorem. Assume that U Q R n is convex, e > 0, and that f : U —> 
R is bounded above on compact sets and satisfies 

(2.26) f^^M+M+e. 
Then there exist convex functions g,go~. U — ► R such that 

(2.27) f{x) < g{x) < f{x) + «(n)e and \f(x) - go(x)\ < ^-s 

for all x £ U. The constant n{n) is the best possible constant in these 
inequalities. 

Proof. By replacing / by e~ 1 f we may assume e = 1 so that / is approxi- 
mately convex. Following Hyers and Ulam [Q, p. 823] or Cholewa 0, pp. 81— 
82] set W := {(x, y) G R n x R : y > f(x)} and define g by 

g(x) := M{y : (x,y) G Co(TU)}. 

We now show that g does not take on the value — oo. If (x, y) G Co(W) then 
by Caratheodory's Theorem there exist n+2 points (xo, yo), . . . , [x n +i,y n +i) G 
W and (ao, . . . , a n ) G A n+i such that (x, y) = 52k=Q a k{xk,Vk)- Therefore 



APPROXIMATELY CONVEX FUNCTIONS AND THE SIZE OF CONVEX HULLS 27 



by Corollary ^32 



( n+1 \ n+1 

^2 akXk ) < K ( n + 1 ) + ^2 a kf( X k) 
k=0 ' k=0 

n+1 

< k(ti + 1) + ^ a k y k = n(n + l)+y. 

k=0 

Thus y > f{x) — n{n + 1) which implies g{x) > f(x) — n{n + 1) > — oo. 

^From the definition it is clear that g(x) < f(x) and that g(x) is con- 
vex. To see that f(x) < g(x) + «(n) let 5 > and choose y so that 
(x, y) G Co(VF) and y < + <5- Then as above there are n + 2 points 
(x ,yo), ••• , («n+l,2/n+l) G W with (x,y) G VF and such that (x,y) G A : = 
Co({(x ,yo), • • • , (x n+ i,y n+ i)}). Let y := min{r/ : (21,77) G A}. Then 
(x,y) is on the boundary of A and so it is a convex combination of n + 1 
of the points (x , y ), . . - , (x n+ i, y n +i), say (x,y) = ELo a fc( x fc'^) witn 
(ao, • • • , a n ) G A n . Then a calculation like one showing that g{x) > —00 
(but with n+1 replacing ra + 2) yields that f(x) < y + K(n) < g(x) + 5+K(n). 
As 5 > was arbitrary this implies /(x) < g(x) + /t(n). 

Letting go(x) = g(x) + n{n)/2 we have |/(x) — go(^)| < n(n)/2. 

Finally to see that the constants in question are sharp consider the almost 
convex function E: A n — > R which has maxE = n(n). Then the largest 
convex function g on A n with g < E is y(x) = 0. Likewise ^(x) = n(n)/2 
has |-E(a?) — ffo^)! — n{n)/2 and no other convex function on A n gives a 
better estimate. □ 



2.7. Examples of approximately convex functions. Here we give ex- 
amples showing that the hypothesis of our results are necessary. 

2.40. Example. Let f(t) be any approximately sub-affine function on [0,1]. 
Then (as in the proof of Proposition 2.24) the function F(x) := f(xo) + 



f(x\) + • • • + f{x n ) defined on the simplex A n will be approximately con- 
vex. Using the function f(t) = tlog 2 (l/t) shows that for example F{x) := 
Sfc=o x k l°g2(l/ x fc) is approximately convex (cf. Lemma 2.22]) . As a slight 



generalization of this if /o, . . . , f n are all approximately sub-affine then F\ (x) 
fo(%o) + fi( x i) + • • • + fn(xn) is approximately convex. □ 



2.41. Example. Let C be any convex subset of any normed vector space and 
let if: C -> [0,1]. Then tp((x + y)/2) < 1 < (ip(x) + <p(y))/2 + 1 so tp is 
approximately convex. There is no assumption on cp other than the bounds 
< ip < 1. Thus ip need not be continuous or measurable. So approximate 
convexity by itself does not imply any type of regularity of the function. □ 



2.42. Example. View R n+ as a vector space over the rational numbers Q 
and let B be a Hamel basis for R n+ over Q. Let h: R n+1 -> R obtained 
by first mapping B to R and then extending to R n+1 by linearity. We 
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can choose B C A n = Co{eo, . . . , e n } (with eo, • • • , e n the standard basis of 
R n+1 ) and h so that h[B] is dense in R. Therefore h is unbounded on A n . 



To get an example more closely related to Theorem 2.3 let h be as just 
defined but chosen in such a way that h(ei) = for < i < n and set 
ho(x) := max{h(x), 0}. Then for A := {eo, . . . , e n } we have A n = Co(A), 
ho is bounded from below, and ho = on A. But ho is not bounded from 
above on A. This shows the assumption that h be bounded from above 
in Theorem |2.3| is necessary. A similar example appears in the paper of 
Cholewa [|, §3]. □ 

2.43. Example. As an extension of the last example let A^ —1 for < k < n 
be the ((n — l)-dimensional) faces of A n . For each k choose an unbounded 
approximately convex function h^ : A^_ x — > [0, oo) that vanishes on the 
vertices of A^_ x (possible by the last example). Let h: A n — * [0, oo) be 
h{x) = on the interior of A n and for each face h\. k = h^. (A little care 

n— 1 

must be taken in the choice of the h^'s to ensure that these restrictions agree 
on the intersections of the faces. This is not hard to arrange and we leave the 
details to the reader.) Then as the boundary of A n (which is Ufc=o ^n-i) * s 
a set of measure zero the function h is Lebesgue measurable on A n , but is 



not Borel measurable. This shows that the hypothesis of Theorem 2.31 can 



not be weakened from Borel measurable to Lebesgue measurable. □ 

3. The Size of the Convex Hull of an Approximately Convex 

Set 

In this section we apply our results on approximately convex functions 
to the problem of giving a priori bounds on the size of convex hull of an 
approximately convex set. 

3.1. General upper bounds. We now apply our results to the geometric 
problem of computing the size of the convex hull. 

3.1. Theorem. Let \\ ■ \\ be any norm on R n and let A C R" be a set 

that is approximately convex in this norm. Let b E Co(A) so that for some 
ao, ■ ■ ■ , a m S A with m < n we have b = YlT=o a k a k where (ao, • • • , a n ) G 
A m , then 

(3.1) dist(6, A) < E(a , a m ) < /c(m) < n(n). 

(In the terminology of Theorem || this implies that CVii < k(u).) 

3.2. Remark. For bounded sets this result can be restated in a dilation in- 
variant fashion that does not involve approximately convex sets in its state- 
ment: If A C R n is bounded set and b S Co(A) so that b = Ylk"=o a k a k as 
in the statement of the theorem, then 

dist(&,A) < E(ao,...,a m )d H ^z{A + A),A) < K(m)d H (^(A + A),A). 
The results below have similar dilation invariant versions. □ 
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Proof. Define a function /: A m — > [0, oo) by 

/(A),-- - ,An) := dist ( ^/3 m a fc ,AJ 



Then as the function function x i— > dist (a; , A) on R n is an approximately con- 
vex function and the map (/?o, • • • , m ) i— ► X^fe=o A a fc ^ s affine the function 
/ is approximately convex and it is clearly continuous. Also / vanishes on 



the vertices of A m . Therefore by Theorem |2.3C| the bound f(@o, ■ ■ ■ , (3 n ) < 



E(Pq, . . . , p n ) holds. But this implies (3.1). □ 



Recall that a subset A C R n is convexly connected iff there is no 
hyperplane H of R n so that A meets both half spaces determined by H 
but does not meet H. Each subset A decomposes uniquely into convexly 
connected components. 

3.3. Theorem. Let || • || be a norm on R" and let A C R n which is approx- 
imately convex in this norm. Assume that either A has at most n connected 
components or A is compact and has at most n convexly connected compo- 
nents. Then any b G Co(A) satisfies dist(6, A) < n{n — 1). 

Proof. In either of the two cases there is a refinement of Caratheodory's 
Theorem (cf. ||) which implies that b is a convex combination of n points 



ao, ■ ■ ■ ,a n — 1 points of A. Then Theorem 3.1 with m = n — 1 implies 



dist (6, A) < k(u - 1). □ 

In a normed space we will use the notation Br(xq) for the closed ball of 
radius R about xq. 

3.4. Proposition. Let \\ • || be a norm on R n and A C R n a closed subset of 
R n . Assume that xo € R n \A is a point where the function x \— ► dist (a;, A) 
has a local maximum. Set R := dist(xo,^4) and let A\ := Br(xq) n A be 
the points of A at a distance R from xq . Then there are points ao , . . . , S 
A\ with k < n and norm one linear functionals Ao,...,Afc G R n * so that 
\i(a,i — xq) = R (i.e. Aj norms ai — xo) and with G Co{Ao, . . . , Xk}- 

Proof. By translation and rescaling we may assume xq = and R = 1. Let 
S := {u £ R n : = 1} be the unit sphere of the norm || • ||. Let || • ||* 
be the dual norm on R™* and S* the unit sphere of || • ||*. For any subset 
C C R n let N*(C) be the set of linear functionals that norm some member 
of C. Explicitly N*{C) := {A G S* : A(c) = ||c|| for some c G C}. If C is 
compact then N*(C) is also compact. (For if (A^)^f :1 is a sequence from 
N*(C) then (as S* is compact) by going to a subsequence we can assume 
that Xi — > A for some A G S*. For each ^ there is a C£ G C with A^(q) = ||q||. 
By compactness of C and again going to a subsequence we assume ci — > c 
for some c G C. But then A(c) = lim£^o-^£(c£) = hni£^o ||q|| = ||c|| which 
shows A G N*(C). Thus any sequence from N*(C) contains a subsequence 
that converges to a point of N*(C) and therefore N*(C) is is compact.) 
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Let dff (•, •) be the Hausdorff distance defined on the compact subsets of 
R n . View the map C >— * N*(C) as a map from the set of compact subsets 
of R n to the set of compact subsets of S*. Then we claim this map is sub- 
continuous in the sense that if dn{Ci,C) — > and K C S* is a cluster 
point of the sequence (N* (Q))^ then K C N*(C). To see this note as K 
is a cluster point of {N*(Ci))'£L 1 by going to a subsequence we can assume 
N*(Ci) — ► AT. Choose A G A. Then we can choose A^ G N*(Ci) in such 
a way that A^ — > A. From the definition of N*(Ce) there is a eg G so 
that A^(e^) = ||q||. By yet again going to a subsequence it can be assumed 
eg — > c for some c G C. But then a calculation like the one showing N*{C) 
is compact yields A(c) = ||c||. Thus A G N*(C). As A was any element of K 
this shows K C N*(C) as claimed. 



Returning to the proof of Proposition 3A. For r > 1 let A r := {a G 
A : || a || < r}. Then, as in the statement of the proposition, Ai is the set 
of points of A at a distance exactly 1 from and so the conclusion of the 
proposition is equivalent to G Co(N* (Ax)) (for if is a convex combina- 
tion of elements of N*(A X ) then the number of elements can be reduced to 
n + 1 by Caratheodory's Theorem). Assume, toward a contradiction, that 
$ Co(iV*(Ai)). Then N*(Ay) is compact and thus Go{N*{A x )) is also 
compact. Therefore the distance from Co{N* [A\)) to is positive, say 25. 
As 1 < r < s implies A\ C A r C A s and f\>i = -^l ^ ^ s no ^ nar d to see 
that lim r \i dj£ {A r , A±) = 0. Thus by the sub-continuity of A^* there is an 
tq > 1 so that the set N* (A ro ) has Hausdorff distance < 5 from some subset 
K of N*(Ai). This implies the Hausdorff distance between Co(A* (A rQ )) 
and Co(AT) is < 5 and as K C N*(A±) this implies dist(0, N*(Ar )) > 5. 
Thus there is a a linear functional on R n * that separates A*(^4 ro ) from 0. As 
the linear functionals on K n * are the point evaluations there is a unit vector 
uq G S and e > so that for all A G N*(A ro ) the inequality A(uo) < — e 
holds. Therefore for any b G A ro we have a A G N*(A ro ) that norms b and 
so for all t > 

||6 - tu \\ > X(b - tu ) = \\b\\ - t\(u ) > 1 + et 

and so dist(tuo, A ro ) > 1 + et for all t > 0. Suppose that ||x|| < (ro — l)/2. 
Then dist(x,A) < dist(0, A) + \\x\\ < 1 + (r - l)/2 = (r + l)/2. Suppose 
that o £ i and that ||a|| > ro. Then ||a — x|| > ro — ||x|| > (1 + ro)/2 > 
dist(x,A). Thus, dist(x,^4) = dist(x, A ro ). In particular this implies that 
for < t < (r - l)/2 that dist(tn ,^) = dist(tw 0) A- ) > 1 + e* > 1. This 
contradicts that dist(-, A) has a local maximum at x = and completes the 
proof. □ 



3.2. General lower bounds. The following result shows that the estimate 
of Theorem |3.1 ] is s harp for all m < n - 
are all sharp. 



and Theorem 



-1 and that Theorem 3.3, Theorem 3.7 
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3.5. Theorem. Let \\ ■ \\ be any norm on R n with n > 2 and let a = 

(ao, . . . , ctn-i) £ A n _i. Then, for any e > 0, there is a compact con- 
nected approximately convex set A C R n and a point b £ Co(A) so that 
b = J2k=o a k a k, with at £ A, so that dist(6, A) > E(ao, . . . , a n -i) — £■ In 
particular, since sup :rgAn _ E(x) = n{n — 1) (cf. \2-2!\ ), for the proper choice 
of a it follows that there is a compact connected approximately convex set 
A C R n and a point b £ A so that dist(6, A) > n{n — 1). (In the terminology 
of Theorem [1| this implies that Cn.n > k(ji — 1).) 

Proof. Let || • || be any norm on R n and let A G R n * be a linear functional 
on R n with ||A|| = 1. Let u G R™ be a vector with ||u|| = lA(u). Let 
S := {x £ R™ : A(x) = 0} be the null space of R™. Choose n points 
ao, • ■ • , a n in S that are affinely independent. For each M > define 

V M ■= Co{Ma ,...,Ma n _i}. 

Any point of Vm is uniquely of the form X^fc=o x kMak for some ^fc=o x k e k G 
A n _ x . Define F M on Vm by 

(n— 1 \ / n—1 

fc=0 ' ^ k=0 

Finally set 

A M := {x + yu: x £ V M , F M {x) < y < n(n - 1) + 1}. 

Since E is lower semi-continuous Fm is also lower semi-continuous. This 
implies Am is closed and bounded. (To see Am is closed: xi + y^u £ Am 
and X£ + yeu x + yu implies X£ — > x and Fm(x) < liminf^oo Fm(xi) < 
lim^^oo yt = y and so x+yu £ Am-) It is also easy to check Am is connected 
(and in fact contractible) . That Am is an approximately convex sets follows 
from E being an approximately convex function. 
Let e > and define ippj : A n _i — > Vm by 

(n—1 \ n—1 

X! X fc e fc ) = X k Ma k- 
k=0 ' k=0 

Then Fm ° Pm = E. Fix a norm || • ||o on R n . Then there is a constant 
C > so that 

\\<Pm(x) - VM(y)\\ > CM\\x - y|| for all x,yGA n _i. 

(C will depend on || • ||o-) Since E is lower semi-continuous U := {x £ 
A n _i : E{x) > E(a) — e] is open in A n _i and thus there is an R > so 
that Bji(a) n A n _i C U. Let := (fM^k)- Then as E(ek) = we have 
afc £ Am for < k < n — 1. Let 6 := = Sfc=o a fe a fc- 11 w £ then 

w = z + [3u where z £ V M andF M (z) < /3 < k(u-1) + 1. If ||z-b|| < MCR, 
then F M (z) > £?(a) - e so that ||z + (3u - b\\ > X(/3u) = (5 > E(a) - e. If 
|| jar - y\\ > MCR, then 

\\z + (3u- b\\ > \\z - b\\ - p > MCR - n(n - 1) - 1. 
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Now choose M so that MCR > 2n(n - 1) + 1 so that MCR - k(u - 1) - 1 > 
k(u - 1) > E(a) - e. Then \\z + 0u- b\\ > n(n - 1) — e for all z + (3u G A M 
and so dist(6, Am) > -E'(a) — e. This completes the proof. □ 

In the terminology of the last proof define a function hu : A n _i — > [0, oo) 
by ZiAf := dist((/9M(^) 5 -Am)- Then hu is approximately convex and Km 
vanishes on the vertices of A n _i. Also hu is continuous and in fact Lip- 
schitz continuous. The proof shows that for each fixed a E A n _i that 
liniAf^oo /ia/(o) = E(a). Replacing n — 1 by n we thus have: 

3.6. Proposition. There is a sequence of Lipschitz continuous approxi- 
mately convex functions (h^^Q on A n vanishing on the vertices of A n such 
that lim^oo hg(x) = sup^ >1 hg(x) = E(x) for all x S A n . □ 



3.3. The sharp bounds in Euclidean Space. Theorem 3.1 can be im- 
proved in Euclidean spaces. 

3.7. Theorem. Let R n have its usual inner product norm and let A C R™ 
be approximately convex. Then any point b G Co(A) has dist(6, A) < 
n{n — 1). (When combined with Theorem 3.5 and using the terminology 
of Theorem [1] this implies Cn 
sions.) 



n(n — 1) in Euclidean spaces of all dimen- 



We will denote the usual inner product on R n by (•,•). Let S n 1 be the 
unit sphere in R™ with the Euclidean norm. Set 

{A C S"- 1 : #(A) = n + 1 and G Co(i)°}, 



(3.2) 



S(n) 



so that S(n) can be thought of as the set of simplexes inscribed in the sphere 
that have the origin in their interior. An n-dimensional simplex that has 
all its edge lengths equal is a regular simplex. Recall that any two regular 
simplices with the same edge lengths are congruent. We leave following 
calculations to the reader. 

3.8. Proposition. Let A C S n be the set of vertices of a regular n- 
dimensional simplex (so that #(A) = n + 1) inscribed in the sphere. Then 
A £ S(n) and the edge length of A is given by 



2(n + l) 



n 



(a,b £ A and a ^ b). Moreover the distance of the midpoint of the segment 
between a and b to the origin is 



a + b 



n 



2n 



□ 



Define M: S(n) 



[0,2] by 
M(A) 



max \\a 

a,b£A 
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Then M(A) is the length of the longest edge of the simplex with vertices A. 
The following characterizes the regular simplexes in terms of minimizing M. 
on S(n). 

3.9. Theorem. Let A £ S(n). Then 

with equality if and only if A is the set of vertices of a regular simplex. 

3.10. Lemma. Let A = {xo, . . . ,x n } € S(n) and assume that 

(3.3) ||x - x n -i\\ < \\x - x n \\. 
Then there is a point Xq G S"™ -1 so that 

(3.4) {xq,X!,x 2 , • • • ,x n } € S(n), 

(3.5) ||Xq — X n \\ < \\xq X n ||, 

(3.6) ||xq - x n -i\\ < \\x - x n \\, 

(3.7) \\xq — Xi\\ = \\xo — Xi\\, 1 < i < n — 2. 

Proof. Since is in the interior of Co(^4) any subset of A of size n will be 
linearly independent. For 1 < i < n define fi : R n — ► [0, oo) and p; L : S n —>■ 
R n by 

fi(x} . — || X X^||, p^(x) . — || X X^||. 

(pi is the restriction of /j to S 1 " -1 .) Let V/j be the usual gradient of fi and 
V pi the gradient of pi as a function on S 11 ^ 1 . (That is Vpi is the vector 
field tangent to S""" 1 so that for smooth curves c(t) in S 11 ^ 1 the equality 
^pi(c{tj) = (c'(t),X7pi(c(t))} holds.) Then a standard calculation gives 

V/,(x) = 

As pi is the restriction of fi to S 1 " --1 the vector field Vpi(x) is the orthog- 
onal projection of Vfi(x) onto the tangent space T(S n ~ 1 ) x to S*"^ 1 at x. 
Therefore 

/ s XQ-Xi I Xq - Xi \ 

Vpi(X ) = J [7 - < Tj T7,X )X . 

||x -Xi|| \||Xo-Xj|| / 

But then the n — 1 vectors Vpi(xo), . . . , V/9 ra _2(xo), Vp ra (xo) are linearly 
independent as any nontrivial linear relationship between them would lead 
to a nontrivial linear relationship between xq, . . . , x n _2, x n which are linearly 
independent. The implicit function theorem implies that the n — 1 functions 
pi, ... , p n -2,Pn ar e local coordinates on S 1 " -1 near xo (that is the map x i— ► 
(pi(x), . . . , p n -i(x), p n (x) is a diffeomorphism onto an open set in R n_1 
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when restricted to a small enough open neighborhood of xo). Let <5j = 
Pi(xo) = \\xq — Xi\\ and set 

N :={x G S™- 1 : Pi (x) = 5i,l<i<n-2} 
={x G S n ~ l : \\x - Xi\\ = 5i,i < i < n - 2}. 

As pi, . . . , p n -2, Pn are local coordinates near xo this will be a smooth curve 
in S n ~ l near xo and, moreover, any point Xq G N will satisfy all the con- 
ditions ( |3.7| ), Choose a parameterization c: (— e, e) N of N near xo 
with c(0) = xo- As pi, ■ ■ ■ , pn-2, pn is a local coordinate system near xo 
and the first n — 2 for these functions are constant on c{t) we have that 
jjjPn(c(t)) | Q = (Vp n (xo), c'(i)) ^ 0. Without loss of generality we can 
assume that ^Pi(c(t))| t _ < (otherwise replace c(i) by c(— t)). Then for 



small t > we have p n {c(t)) < p n (c(0)) = p n {xo). Also the conditions ( 3,4 
and ([O]) are open conditions in Xq and so for any t sufficiently close to 
they will hold for Xq = c{t). Therefore x^ = c(t) for small positive t satisfies 
the conclusion of the lemma. This completes the proof. □ 



Proof of Theorem 3.L We prove the theorem by induction on n. The base 



case of n = 1 is trivial. Let S(n) be the closure of S(n), that is 

S(n) ={Ac S* 1 - 1 : #(A) < n + 1, G Co(A)}. 

Then the function ^(^4) = max a ^A |l a — ^11 is continuous on S(n) and S(n) 
is compact, so A4 obtains its minimum at some Aq G S(n). If this minimum 
occurs at a boundary point of S(n) then G Co(Aq), but ^ Co(Aq)°. Let 
a, b G Aq be the points of Aq so that \\a — b\\ = A^^o)- Then there exists 
a subset {a, b} C A\ C Ao so that #(Ai) =: m + 1 < n + 1 with Ai affinely 
independent and so that is in the relative interior of Co(Ai). Thus, with 
obvious notation, A\ G S(m) and therefore by the induction hypothesis 
■M-(Ai) > \/2(m + 1) jra. But for the regular simplex in S(n) that A4 has 
the value y / 2(n + l)/n, which is less than \jKjn + l)/7n. Therefore the 
minimum of M. on Sin) occurs in S[n). 

Again, let Aq G S{n) be where A4 obtains its minimum, and let c = 
A^(Ao). If every edge of Aq has length c then Aq is a regular simplex and 
we are done. The number of edges of Aq is ("'t )■ So assume that there 
are k < C^ 1 ) edges that have length c. Then there will be a side {xo,x ra } 
of length c that has a vertex in common with a side {xo,x n _i} that was 
a length less than c. With this notation let Aq = {xq, . . . ,x n }. Then by 



Lemma 3.10 we can replace xo be some Xq so that if A\ := {xq, xi, . . . , x n } 
then both the edges {xg,x n } and {xQ,x n _i} have length < c and all of the 
other C^ 1 ) — 2 edge lengths stay the same. Therefore A\ has only k — 1 
edges of length c (and if k = 1 then all edges of A\ have length less than 
c). By repeating this procedure k times we end up with A& G S{n) so that 
A'i(Afc) < A4(Ao), contrary to the assumption that Aq was the minimizer. 
Thus the minimizer must be regular. This completes the proof. □ 
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If x,y £ S n 1 then ||x + y\\ 2 + ||x — y\\ 2 = 4. Whence the distance 
+ of the midpoint of the segment xy from the origin is determined 



by its length. Therefore Theorem |3.9| implies the following: 



3.11. Corollary. Let S(n) be defined by (3.i) above and let T>: S{n) 
[0, 2] be given by 



T>(A) = min 

a,b£A 

Then for all A £ S(n) the inequality 



a + b 



holds. Equality holds if and only if A is the set of vertices of a regular 
simplex. □ 

The following is what is needed in the proof of our main results. 

3.12. Proposition. Let B t {xq) be a ball of radius r in R n with the Eu- 
clidean norm and assume that there are n + 1 points {ao, ■ ■ • , a n } C dB r (xo) 
such that xq is in the interior of the simplex Co{ao, • • • , a n }. Assume that 
for each pair {ai, a j} that the distance of the midpoint (aj + Oj)/2 to dB r {xo) 
is < 1. Then 

r < < Kin — 1) 

n + 1 - v ; 



Proof. By Corollary |3.11 there exists a pair {aj, a.,} such that ||aj + aj||/2 < 
ry/(n- l)/(2n). So 



l>distf^±^,^(xo))>r-^±^>r 1- ln 



2n 



Solving for r gives r < \/2m(\^2n — y/n — l)/(n + l). To see that r = r{n) < 
k(u — 1) first note r(n) < 2 + y/{2) < 3.42 for n > 1. If n > 4 we then have 
r(n) < 3.5 = «(4 - 1) < «(n - 1). This only leaves r(2) = 2 = k(2 - 1) and 
r (3) = v / 3(v / 3 + l)/2 < 3 = «(3 — 1). □ 



Proof of Theorem \3. % By replacing A by its closure we can assume that A 
is closed. Define /: R™ — > R by /(x) := dist(x,A). By Caratheodory's 
Theorem, it suffices to prove that if {ao, . . . , a n } C A then f(x) < n(n — 1) 
for all x £ Co({ao, . . . , a n }). To simplify notation set A := Co({ao, . . . , a n }) 
and let xq be the point where f\ A achieves its maximum. Then we wish 
to show f(xo) < n{n — 1). If xq is on the boundary (or if {ao, . . . , a n } is 
not affinely dependent) then xq is a convex combination of < n points of 
{ao, • • • , a n } and so f(xo) < k(u — 1) by Theorem |2.3| . 

This leaves the case where xq is in the interior of A. Then /(•) = dist(-, A) 
has a local maximum at the interior point x$ of Co(^4). Let R := f(xo). 



Then, by Proposition ^4, there are points ao,...,Ofc G A n Br(xq) so 
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that {do,..., a,k} is an affinely independent set and there are unit vec- 
tors Uq, . . . ,Uk so that the functional Aj := {-,Ui) norms a% — xq and G 
Co{uq, . . . ,Uk}- But if Aj norms dj — xq then U{ = (a, — Xi)/\\ai — 
Therefore G Co{uo, • • • , Ufc} implies G Co{ao, . . . , a^}. Now Proposi- 
tion 3.12) implies /(xq) = R < K(n — 1). This completes the proof. □ 



3.13. Remark. Let ^4 be the seven point subset of the Euclidean plane shown 
in Figure ||. Then A is approximately convex and satisfies dn(Co(A), A) = 
2 = k(1). In higher dimensions we do not know if there exist such examples 
oficR" with d H (Co(A),A) = «(n - 1). 







.(-V^,2) _ 




.(\/3,2) 




(-3\/3, 0) 


(-2V3, 1) 




, (0,1) 




. (2V3, 1) 

(3V3, 0) 


1 


1 1 ! 1 1 



FIGURE 3. A two-dimensional Euclidean example. 



3.4. The sharp two dimensional bounds. We now give the sharp esti- 
mate for the size of a convex hull in all two dimensional normed spaces. 

3.14. Theorem. Suppose || • || is a norm on R 2 and that A C X has 
is approximately convex in this norm. Then any point b G Co(A) has 
dist(6, A) < 2. (By Theorem \3. b\ given e > 0, there exists an approximately 
convex A £ C R 2 and a b G Co(A) so that dist(6, A) > 2 — e and thus thus in 
the notation of Theorem^ Cm. 11 = 2 for all two dimensional norms.) 

3.15. Lemma. Let V = {a, b, c, —a, —b, —c} be the vertices of a symmetric 
convex hexagon. Then 

{a + b,b + c,c + a} n Co(V) / 0. 

Proof. By applying a linear transformation we may assume a = (—1,1) and 
b = (—1,-1). Without loss of generality we also assume c = (xo,yo)> where 
-1 < Uo < and x > 1. If y > 2 - x , then a + b = (-2, 0) G Co(V), and 
we are done. So we may assume that c G Co({(l, 0), (1, —1), (2, 0), (3, —1)}). 
(Co({(l, 0), (1, -1), (2, 0), (3, -1)}) is shaped region in Figureg) This forces 
the quadrilateral Co({0, a, c, —b}) to contain the parallelogram Co({0, a, c, a+ 
c}), and so 

a + c G Co({0, a,c,a + c}) C Co({0, a, c, -b}) C Co(V). □ 

For the rest of this section we will call a norm on a finite dimensional 
space || • || smooth if it is a C°° function away from the origin and the unit 
ball is strictly convex. A finite dimensional space is smooth iff its norm is 
smooth. This implies that norming linear functionals are unique. 
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Figure 4. 

3.16. Lemma. Let X be a smooth two-dimensional normed space. Suppose 
that K C 51(0) is a closed set and that £ Co(K). Then f(x) = dist(x,K) 
does not attain a local maximum at x = 0. 

Proof. As (X, || • ||) is smooth for each u G Si(0) there is a unique norm linear 
functional X u that norms u, the map u 1— > X u is a homeomorphism of S\(0) 
onto the unit sphere S'i(O) in the dual space (X*, \\ ■ ||*), and A_ u = — X u . 
If u G S'i(O) then Si(0) \ {u,—u} has exactly two connected components. 
A closed subset K C Si(0) satisfies ^ Co(iT) if and only if there is a 
u G 5"i(0) so that if is contained in one of the connected components of 
S'i(O) \ {u, — (for this is equivalent to being able to separate K from 
the origin by a linear functional). But the properties of the map u 1— > A n 
imply is contained in a connected component of Si(0) \ {— u, n} if and 
only if N*{K) := {X u : u G -ftT} is contained in a connected component of 
Sf(0) \ {X u ,-X u }. Therefore ^ Co(K) if and only if £ Co(JV*(liQ). 
But by Proposition |3^J ^ Co(A^"*(K)) implies that / does not have a local 
maximum at 0. □ 

Let e > 0. A set A C X will be said to be e-separated if ||a — 6|| > e 
whenever a, 6 are distinct elements of A. 

3.17. Lemma. Suppose that X is a smooth two-dimensional normed space 
and that A C X is e-separated and approximately convex. Then djj{A, Co(A)) < 
2. 

Proof. Let f{x) = dist(x,^4) (x G X). By Caratheodory's Theorem, it 
suffices to prove that if {d, e, /} C A, then f(x) < 2 for all x G A, where 
A = Co({d, e, /}). By continuity of /, there exists xo G A at which / attains 
its maximum. By translation we may assume without loss of generality that 
xo = 0. If G <9(A) then is on a segment between two elements of A 
and so by restriction / to this segment see by Theorem |2.3| /(0) < 2. So 
we may assume that lies in the interior of A. Let R = /(0) and let 
K = A n B R (0). If i Co{K), then by Lemma |1| g(x) = distfx, K) does 



not attain a local maximum at x = 0. But since A is e-separated an easy 
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compactness argument yields dist(0, A \ K) > R, and so f{x) = g(x) for all 
x sufficiently close to x = 0. Thus, f(x) does not attain a local maximum 
at x = 0, which contradicts the fact that lies in the interior of A. 

So we may assume that G Co(K). By Caratheodory's Theorem there 
exists {a, b, c} C K with G Co({a, b, c}) Once again, we may assume that 
lies in the interior of Co({a, b, c}). Now -Br(O) contains the convex hexagon 
with vertices V = {a, b, c, —a, —b, — c}. By Lemma |3.1c , 

{a + b, b + c, c + a} Pi Co(V) / 0. 

Thus 

min{||o + b\\, \\b + c||, ||c + a||} < i?. 

We may assume without loss of generality that \\a + 6|| < R. Since A is 
approximately convex there exists x G A with — (l/2)(a + < 1. Thus 

1 R 

i? = dist(0,A) < ||x|| < i + _|| a + 6|| < 1 + — , 

and so R < 2 as required. □ 



Proof of Theorem 3.14 - Assume A C X is approximately convex Let e > 0. 
There exists an equivalent smooth norm || • ||' on X such that 

\\x\\' < \\x\\ < (l+e)\\x\\' (x G X). 

Let B C i be a maximal e-separated subset of A. Then d' H (A,B) < 
cLh{A, B) < e (here d' H (-, •) denotes Hausdorff distance with respect to || • ||'). 
Thus, 

. / B + B\ , . AS . / A + A\ . fA + A B + B\ 
d' H [B, -^—J < d' H (B, A) + d' H (A, _j + d' H , —I— J 

< e+ l + e = l + 2e 



Lemma |3T7| applied to || • ||' and B yields d^(-B, Co(B)) < 2(1 + 2e). Thus, 
d H (A, Co(A)) < (1 + e)%(A, Co(A)) 

< (1 + £ )((^(B, 00(5)) + 2d' H (A, B)) 

< (l + e)(2(l + 2e)+2e) 

Since e > is arbitrary, we obtain dn(A, Co(A)) < 2 as desired. □ 
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